ENTREVIS - a Spanish machine-readable text corpus
نویسندگان
چکیده
منابع مشابه
Machine-Readable Dictionaries in Text-to-Speech Systems
This paper presents the results of an experiment usiug machine-readable dictionaries (Mill)s) and corpora for building concatenativc units for text to speech (T'PS) systems. Theoretical questions concerning the nature of t)honemic data in dictionaries are raised; phonemic dictionary data is viewed as a representative corpus over which to extract n-gram phonemic frequencies in the language. Dict...
متن کاملEuskoParl: a speech and text Spanish-Basque parallel corpus
The advances in corpus-based approaches and machine learning techniques have promoted the development of minority languages. The contribution of this work is to acquire a parallel corpus in Spanish and Basque with both text and speech data. In order to be able to compare the systems with those developed for other languages, Europarl corpus was taken as a reference in both domain and size. The a...
متن کاملA Large Spanish-Catalan Parallel Corpus Release for Machine Translation
We present a large Spanish-Catalan parallel corpus extracted from ten years of the paper edition of a bilingual Catalan newspaper. The produced corpus of 7.5 M parallel sentences (around 180 M words per language) is useful for many natural language applications. We report excellent results when building a statistical machine translation system trained on this parallel corpus. The Spanish-Catala...
متن کاملMachine-Readable Dictionaries
The papers in this panel consider machine-readable dictionaries from several perspectives: research in computational linguistics and computational lexicology, the development of tools for improving accessibility, the design of lexical reference systems for educational purposes, and applications of machine-readable dictionaries in information science contexts. As background and by way of introdu...
متن کاملMachine-Readable Dictionary Headwords
(5) The idea behind dictionaries for machine processing and that behind dictionaries that people can thumb through are di erent, so we have to be aware that di erent terms are used. It is often best to include in machine dictionaries even terms that are easy for people to understand and to break down into their key elements. For example, :n6HCf (on duty, at work) and F17nKv (the end of the same...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: HERMES - Journal of Language and Communication in Business
سال: 2015
ISSN: 0904-1699
DOI: 10.7146/hjlcb.v4i7.21476